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Abstract 

We study the Minimum Submodular-Cost Allocation problem (MSCA). In this problem we are given 
a finite ground set V and k non-negative submodular set functions fi, . . . , fk on V. The objective is to 
partition V into k (possibly empty) sets Ai, - ■ ■ ,Ak such that the sum Yli=i fii^i) is minimized. Several 
well-studied problems such as the non-metric facility location problem, multiway-cut in graphs and hyper- 
graphs, and uniform metric labeling and its generalizations can be shown to be special cases of MSCA. In 
this paper we consider a convex-programming relaxation obtained via the Lovasz-extension for submodular 
functions. This allows us to understand several previous relaxations and rounding procedures in a unified 
fashion and also develop new formulations and approximation algorithms for several problems. In partic- 
ular, we give a (1.5 — l/fc)-approximation for the hypergraph multiway partition problem. We also give a 
min{2(l — 1/fc), i?A}-approximationforthe hypergraph multiway cut problem when A is the maximum hy- 
peredge size. Both problems generalize the multiway cut problem in graphs and the hypergraph cut problem 
is approximation equivalent to the node-weighted multiway cut problem in graphs. 
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1 Introduction 



We consider the following allocation problem with submodular costs. 

Minimum Submodular-Cost Allocation (MSCA). Let V he a finite ground set and let /i, • • • , f^he k non- 
negative submodular set functions on V. That is, for 1 < i < A;, /j : 2^ — )• and fi{A) + fi{B) > 
fi{A U B) + fi{Ari B) for all A,B <ZV. In the MSCA problem the goal is to partition the ground set V into 
k (possibly empty) sets Ai, - ■ ■ , such that the sum Yl\=i fii^i) niinimized. 

We observe that the problem is interesting only if the /j's are different for otherwise allocating all of V 
to /i is trivially an optimal solution. We assume that the functions /j are given as value oracles, although in 
specific applications they may be available as explicit poly-time computable functions of some auxiliary input. 
The special case of this problem in which all of the functions are monotone {f{A) < f{B) if A <^ B) has been 
previously considered by Svitkina and Tardos |j22]. In this paper, we consider the problem with both monotone 
and non-monotone functions. We show that several well-studied problems such as non-metric facility location, 
multiway cut problems in graphs and hypergraphs, uniform metric labeling and its generalization to hub location 
among others can be cast as special cases of MSCA. In particular, we investigate the integrality gap of a simple 
and natural convex-programming relaxation for MSCA that is obtained via the use of the Lovasz extension of 
a submodular function. 

Lovasz extension and a convex program for MSCA: Let y be a finite ground set of cardinality n. Each 
real-valued set function on V corresponds to a function / : {0, 1}" — > M on the vertices of the n-dimensional 
hypercube. The Lovasz extension of / to the continuous domain [0, 1]*^ denoted by / is defined a^ 

/(x)= E [/(x^)j = /'/(xV 

where x^ € {0, 1}" for a given vector x G [0, 1]" is defined as: xf = lif Xi > 6 and otherwise. 

Lovasz showed that / is convex if and only if / is a submodular set function |17|. Moreover, it is easy to 
see that, given x, the value /(x) can be computed in polynomial time by using a value oracle for /. Via this 
extension, we obtain a straightforward relaxation for MSCA with a convex objective function and Unear con- 
straints. Let vi, - ■ ■ ,Vn denote the elements of V. The relaxation has variables x{v, i) foxv^V and 1 <i < k 
with the interpretation that x{v, i) is 1 if u is assigned to Ai and otherwise. Let Xj = {x{vi,i), • • • , i)). 
The relaxation is given below. 
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Throughout, we use OPT and OPTfrac to denote the value of an optimal integral and an optimal fractional 
solution to LE-Rel (respectively). 

We remark that LE-Rel can be solved in time that is polynomial in n and log (max^cv f{S)) via the 
ellipsoid method; we give some of the details in Appendix [A] Moreover, for some problems of interest the 

'The definition is not the standard one but is equivalent to it; see \24l or AppendixlAl This definition is convenient to us in describing 
and understanding rounding procedures. 
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above convex program can be rewritten into an equivalent linear program. We now describe several problems 
that can be cast as special cases of MSCA, and also how some previously considered linear-programming 
relaxations can be seen as being equivalent to the convex program above. 

1.1 Problems related to MSCA 

Monotone MSCA (Monotone-MSCA) and Facility Location: In faciUty location, we have a set of fa- 
cilities T and a set of clients or demands T). There is a non-negative cost cij to connect facility i to client 
2 (we do not necessarily assume that these costs form a metric). Opening facility i ^ T costs /j. The goal 
is to open a subset of the facilities and assign each client to an open facility so as to minimize the sum of the 
facility opening cost and the connection costs. Svitkina and Tardos ll22l considered the setting where the cost of 
opening a facility i is a monotone submodular function gi of the clients assigned to it, and gave an (1 + In |D|)- 
approximation, and matching hardness via a reduction from set cover. We note that this problem is equivalent 
to MSCA when all the /j are monotone submodular functions, which we refer to as MONOTONE-MSCA. In 
II22I a greedy algorithm via submodular function minimization is used to derive the approximation. Here we 
prove that the integrality gap of LE-Rel is (1 + In |D|), and describe how certain rounding algorithms achieve 
this bound. These algorithms are useful when considering functions that are not necessarily monotone. 

Submodular Multiway Partition (SUB-MP): We define an abstract problem and then specialize to known 
problems. Let / : 2^ — > M+ be a submodular set function over V and let S = {si, S2, • • • , Sfe} be k terminals 
in V . The submodular multiway partition problem is to find a partition of V into A\^. . . ^A^ such that Si £ Ai 
and Ym=i fi^i) minimized. This has been previously considered by Zhao, Nagamochi and Ibaraki |[27l . This 
can be seen as a special case of MSCA as follows. Define the ground set tobeV' = V\S and, for 1 < i < fc, 
fi : 2^' M+ is the function defined as fi{S) = f{SU{si}). If in addition / is symmetric {f{A) = f{V-A) 
for all A) we call this the symmetric SuB-MP problem (Sym-Sub-MP). Note that although the problem is 
based on a single function /, k different submodular functions (induced by the terminals) are needed to reduce 
it to MSCA. We now discuss some important special cases of this problem. 

Multiway Cut in Graphs (Graph-MC): The input is an edge-weighted undirected graph G = {V, E) and 
k terminal vertices S = {si, . . . , Sfc}; the goal is to remove a minimum- weight set of edges to disconnect the 
terminals. This can be seen as a special case of the symmetric submodular multiway partition problem by simply 
choosing / to be the cut-capacity function of G scaled down by a factor of 2. That is, f{A) = ^ ^^Zei^SiA) w{e) 
where w{e) is the weight of edge e. We observe that LE-Rel for this problem is equivalent to the well-known 
geometric LP relaxation of Calinescu, Karloff and Rabani ||2l, which led to significant improvements (1.5 — l/Zc 
in 13 and 1.3438 in |[T4l ) over the 2(1 — l/A;)-approximation obtained via the isolating-cut heuristic O. 

Multiway Cut and Partition in Hyper-Graphs: Given an edge-weighted hypergraph Q = {V, S) and terminal 
set S dV, the Hypergraph Multiway Cut problem (Hypergraph-MC) (see |[Tll|26l[8'|) asks for the 
minimum weight subset of hyperedges whose removal disconnects the terminals. This can be seen as a special 
case of SuB-MP ifTSll : this reduction requires some care and the underlying submodular function is asymmetric. 
A related problem is the HYPERGRAPH MULTIWAY PARTITION problem (Hypergraph-MP) introduced by 
Lawler [16 ] where the cost for hyperedge e is proportional to the number of non-trivial pieces it is partitioned 
into. This can be seen as a special case of the S YM-SUB-MP with / being the hypergraph cut capacity function. 
We note that Graph-MC is a special case of both Hypergraph-MC and Hypergraph-MP. 

Node-weighted Multiway Cut in Graphs (NODE-WT-MC): In this problem ||9l the graph has weights on nodes 
instead of edges and the goal is to find a minimum weight subset of nodes whose removal disconnects a given 
set of terminals. It is not difficult to show that Hypergraph-MC and NODE-WT-MC are approximation 
equivalent [ 18|. 

Zhao et al. [27] consider generalizations of the above problems where some set of terminals S C y and k 
are specified and the goal is to paitition V into k sets such that each set contains at least one terminal and the 
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total cost of the partition is minimized. We do not discuss these further since they are not directly related to 
MSCA, although one can reduce them to MSCA if A; is a fixed constant. 

Uniform Metric Labeling and Submodular Cost Labeling (Sub-Label): The metric labehng prob- 
lem was introduced by Kleinberg and Tardos 1 151 as a general classification problem. We are given an undi- 
rected edge-weighted graph G = {V,E) and k labels and the goal is to assign a label to each vertex to minimize 
the labeling cost and the edge-cut cost. Assigning label itov incurs a cost Ci{v) and if an edge uv of weight 
w{uv) has u labeled with i and v labeled with j then the edge-cut cost incurred is w{uv) ■ d{ij). The uniform 
metric labeling problem is obtained when d{ij) = 1 for all i ^ j. We consider the following generalization that 
we call the SuBMODULAR COST Labeling (Sub-Label) problem which is a special case of MSCA. The k 
labels correspond to the k functions /i, . . . , /fc. We define fi as the sum of two functions, a monotone function 
gi that models the label assignment cost, and a non-monotone function h that models the cut-cost. The goal 
then is to partition V into Ai, . . . , to minimize Yli=ii9i(.^i) + ^(^j))- Note that uniform metric labeling 
is the special case when are modular and h is the graph cut function, which is symmetric. We are motivated 
to consider this generalization by problems that have been considered previously, such as metric labeling on 
hypergraphs, hub location problem lITOl . and the extension of metric labehng to handle label opening costs ||5]|. 

1.2 Overview of Results and Techniques 

In this paper we examine the complexity of MSCA primarily through the "integrality gap" of the convex 
relaxation LE-Rel which can be optimized in polynomial time. All the problems we consider are NP-hard and 
our focus is on polynomial time approximation algorithms. 

A significant portion of our contribution is to highlight the naturalness of MSCA and the Lovasz-extension 
based relaxation LE-Rel by showing connections to previously studied problems, linear programming relax- 
ations, and rounding strategies. Viewing these problems in the more abstract setting of submodularity gives 
insights into prior algorithms. In the process, we obtain new and interesting results. Although one would like to 
obtain a single unifying algorithm that achieves a good approximation for MSCA, it turns out that LE-Rel has 
a large integrality gap and we believe that MSCA is hard to approximate to a polynomial factor. However, it is 
fruitful to examine special cases of MSCA that admit good approximations via LE-Rel. We describe several 
applications below by summarizing our results; all of them are based on LE-Rel. 

• The integrality gap of LE-Rel for MONOTONE-MSCA is 6(log n). 

• There is a (1.5 - l/A:)-approximation for Hypergraph-MP. 

• There is a min{2(l — 1/A;), ffA}-approximation for Hypergraph-MC, where A is the maximum hy- 
peredge size and Hi is the i-th harmonic number. For A = 2 this gives a 1.5-approximation and for 
A = 3 this gives a 1.833-approximation. 

• LE-Rel for Hypergraph-MC gives a new mathematical programming relaxation for Node-wt-MC 
and a new 2-approximation. Moreover, if all non-terminal nodes have degree at most 3 we obtain a 
1.833-approximation improving upon the 2(1 — 1/k) known via the distance-based relaxation [9|. 

• The integrality gap of LE-Rel for Sym-Sub-MP is at most 2 — 2/k; this gives an alternative approxi- 
mation to previous combinatorial algorithms ||T9]|27| . We raise the question as to whether the integrality 
gap is at most 1.5. 

• There is an O(logn) for Sub-Label when the cut function is symmetric. We derive results for other 
special cases of Sub-Label. 
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Rounding the convex relaxation: Recall that the objective function in LE-Rel is Yli=i hi^i)^ where fi{^i) = 
Eggjo 1] [/(xf )]. How do we round while preserving the objective function? If we focus on a specific i, the ob- 
jective function suggests that we pick 6 randomly from [0, 1] and assign the elements in xf to i; we call this 
0-rounding. However, there are two issues to contend with. First, if we independently round for each i then the 
same element may be assigned multiple times. Second, we need to ensure that all elements are assigned, which 
is not guaranteed by the 0-rounding. We remark that there is an integrality gap example for hypergraph metric 
labeling that shows that there is no effective rounding strategy that works in general. 

Our approach is to understand the rounding process by considering various special cases of interest. In par- 
ticular, we consider monotone functions, symmetric functions, the hypergraph separation cost function (which 
is asymmetric), and combinations of such functions. Monotonocity helps in that if elements are assigned to a 
label i, they can be removed without increasing the fractional cost. Although one can use different strategies 
to obtain an O (log n) -approximation and integrality gap, a useful strategy here is the rounding of Kleinberg 
and Tardos that they introduced for metric labeling. This has the additional property of ensuring that an 
element u is assigned to i with probability exactly x{u, i). We then consider the rounding process for SUB-MP, 
in particular the symmetric case S YM-SUB-MP. Here, we crucially take advantage of the fact that there is a 
single underlying function /, and moreover the fact that it is symmetric. We consider the CKR-ROUNDING 
strategy from [2] and show its effectiveness for hypergraphs by abstracting away some of the properties specific 
to graphs that were previously exploited in the analysis. In the process, we also observe that a variant is equally 
effective for graphs but is more insightful for S YM-SuB-MP. 

Finally, Sub-Label combines a monotone function and a non-monotone function. Here, we resort to KT- 
ROUNDING since it is a reasonable strategy to approximately preserve the cost of the monotone component. 
For the uniform metric labeling problem, [15] showed that KT- ROUNDING approximately (to within a factor of 
2) preserves the fractional connection cost in the case of graphs. We show bounds for hypergraph cut functions 
in an analogous fashion. Our insights enable us to develop a variant of the rounding that gives an O(logn)- 
approximation for Sub-Label when the cut function is an arbitrary symmetric submodular function. 

Other Related Work: There has been much recent interest in optimizing with submodular set functions. In 
particular, maximization problems have been examined via combinatorial techniques as well as the multilinear 
relaxation ||l]. The submodular welfare problem [|23l is similar in spirit to MSC A except that one is interested in 
maximizing the value of an allocation rather than minimizing the cost. Minimization problems with submodular 
costs have also received substantial attention [20l[l2l|T3][II] with several negative results for basic problems as 
well as positive approximation results for problems such as the submodular cost vertex cover problem [ 13 HI]- 
Lovasz-extension based convex programs have been effectively used for these problems. Various submodular 
cut and partition problems and their special cases such as the hypergraph cut and partition have been studied 
recently 1*271 l26l[T8l [81: however, these papers have typically focussed on greedy and divide-and-conquer based 
approaches while we use LE-Rel. 

Recent Results for Sym-Sub-MP and SUB-MP: Very recently, building on the work in this paper and a 
non-trivial new technical theorem, we showed JS] that the integrality gap of SubMP-Rel is at most 1.5 — l/k 
for Sym-Sub-MP and at most 2 for SuB-MP. 

2 Monotone MSCA 

In this section we consider MONOTONE-MSCA where /i, . . . , are monotone submodular functions. We 
will assume for simplicity that /j(0) = for all i. Svitkina and Tardos ll22l considered this problem in the 
context of facility location and gave a (1 + Inn) -approximation and matching hardness via an approximation 
preserving reduction from set cover. Let a = min5cy,i<i<fc fi{S)/\S\. The main observation in [22] is that 
a < OPT/n, and moreover a pair {S,i) such that fi{S)/\S\ = a can be computed in polynomial time via 
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submodular function minimization. One can then iterate using a greedy scheme, by using the monotonicity 
of the functions, to obtain a (1 + lnn)-approximation. Using a similar argument, we can prove the following 
theorem. 

Theorem 2.1. The integrality gap of LE-Rel /or MONOTONE-MSCA is at most (1 + Inn). In particular, 
a < OPTFRAc/?^■ 



Monotone-MSCA-Greedy: 

let a; be a solution to LE-Rel 

Ai <— for alH (1 < i < k) {{the set of vertices that will be assigned to i)) 

U ■(^ V {{the set of unassigned vertices)) 

while U ^9 

let X be the restriction of a: to [/ 

for each 9, let A{i,9) ^ {v \ v £ U, x{v, i) > 9} 

let = 9i^i < 6*; 2 < • • • < 9i,i. = 1 be the distinct entries of 

let {i' , j') be the pair of indices in the set {(i, j) | 1 < i < k,l < j < £i} 

that minimizes the ratio fi{A(i, 9ij))/\A{i, 9ij)\ 
A,, ^ A,,UA{i',9,,j,) 

U ~A{i',9,,,,,) 



Theorem 2.2. Monotone-MSCA-Greedy achieves an Hn-approximation for MONOTONE-MSCA. 



Before we prove Theorem 12.21 we introduce some notation. Consider iteration of MONOTONE-MSCA- 
Greedy. Consider an iteration of the while loop of Monotone-MSCA-Greedy. Let U be the set of 
elements that are unassigned at the beginning of the iteration, and let x denote the restriction of x to U; more pre- 
cisely, x{v, i) = x{v, i) for all terminals i and all vertices w G [/. For any 9, Let A{i, 9) = {v \v & x{v, i) > 
9}. Let = 9i^i < 9i^2 < ■ ■ ■ < 9i/. = 1 be the distinct entries of Xj. Let OPTfrac = Yli=i fii'^i)- Theo- 
rem |2]2] follows from the following lemma. 



Lemma 2.3. 



mm mm 

l<i<kO<j<ei 



h{A{i,9,j)) ^ OPT 

FR 

\A{i,9,,,)\ -^U\ 



In order to prove Lemma 1231 we will show that, if we choose a terminal i € {1,2, - ■ ■ ,k} and a threshold 
9 G [0, 1] uniformly at random, the ratio E[fi{A{i, e))]/E[\A{i, e)\] is at most OPTpRAc/lf^l- The following 
propositions give a bound on the two expectations K[fi{A{i,6))] and E[[j4(i, 6*)!]; their proofs are relatively 
straightforward and they have been moved to Appendix |B] 



Proposition 2.4. 



Proposition 2.5. 



E[fi{A{i,9))] < ioPTpRAc. 

i,8 k 



m\A{i,e)\ 

t,0 



Proof of Lemma I2.3t Let i be a terminal selected uniformly at random. Let be a threshold selected uni- 
formly at random from the interval [0, 1]. It follows from Proposition 12.41 and Proposition 12.51 that the ratio 
K[fi{A{i, 9))]/E[\A{i, 9)] is at most OPTfrac/|C^|- By linearity of expectation. 
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and therefore there exists a terminal i' and a tlireshold 6' for which the ratio fii{A{i', 6'))/\A{i' , 9')\ is at most 
OPTpRAc/lf^l- Let j' be the smallest index j that satisfies < j < ii' and Oi/j > 9'. Since A{i',6iiji) = 
A{i', 9'), is the desired pair. □ 

In the remainder of this section, we consider a different algorithm that achieves an 0(log n)-approximation for 
MONOTONE-MSCA. We will use this algorithm as a building block for submodular cost labeling algorithms 
(see Section |4|. The algorithm KT- ROUNDING is derived from the work of Kleinberg and Tardos on metric 
labeling lITSl . 



KT-ROUNDING 

let a; be a solution to LE-Rel 

iS* ((set of all assigned vertices)) 

{{set of vertices that are eventually assigned to i)) 

A, ^9 for alH (1 < i < k) 

while 5 7^ y 

pick i G {1,2, - - ,k} uniformly at random 
pick 6 G [0,1] uniformly at random 
A,^A,U {{v I x{v, i)>e}~ S) 
S ^SUA, 
return {Ai, ■■■ ,Ak) 



We prove the following theorem by building on some useful properties that are shown in [15 1; one of these is 
that the probability that v gets assigned to i in the rounding is precisely x{v,i). In particular, this yields an 
optimal algorithm for modular functions. 

Theorem 2.6. KT-Rounding achieves a randomized O (Inn) -approximation for MONOTONE MSCA. 

Proof Sketch: It is shown in j flSl . and not difficult to see, that the rounding terminates in 0{k log n) iterations 
of the while loop with high probability. In each iteration the algorithm does a ^-rounding on an index chosen 
uniformly at random. Let i be the random index and A{i, 9) = {v \ x{v, i) > 0}. Then it is easy to see that 
K[fi{A{i, 9))] = Yli=i \ fi{^i) — iOPTpRAc- Since the functions are monotone, we have that E[/(A(i, 9) — 
S)] < iOPT 

FRAC- Since the algorithm terminates in Oik log n) iterations, by linearity of expectation and the 
sub-additivity of the functions (since the functions are submodular and /(0) = 0), the total expected cost is 
0(logn)OPTpRAc. □ 



3 Submodular Multiway Partition 

We consider MSCA when the /j can be non-monotone. We can show that the integrality gap of LE-Rel 
even for a special case of labeling on hypergraphs can be 0(n), and we suspect that the problem is hard to 
approximate to a polynomial factor in n. We therefore focus on SUBMODULAR MULTIWAY PARTITION (SUB- 
MP) and SUBMODULAR Cost Labeling (Sub-Label); these are broad special cases which capture several 
problems that have been considered previously. 

The reduction of SUB-MP to MSCA requires one to work with the non-terminals V as the ground set. 
It is however more convenient to work with the terminals and non- terminals. In particular, we work with the 
relaxation below. Recall that Xj = {x{vi,i), • • • , i)). 
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As before, a starting point for rounding the relaxation is the basic ^-rounding that preserves the objective 
function. Suppose we do 0-rounding for each i to obtain sets A{1,6), . . . , A{k, 9) where each A{i, 6) C V. 
Here we could use independent random 9 values for each i or the same 9. Note that the constraints ensure that 
Si € A{j, 9) = j. However, the sets ^4(1, 6*), . . . , A{k, 9) may intersect and also may not cover the entire 
set V, in which case we have to allocate the remaining elements in some fashion. First we show how to take 
advantage of the case when / is symmetric and then discuss how to obtain results for hypergraph problems that 
are special cases of SUB-MP. 

A 2(1 - 1/ A;) -approximation for Sym-Sub-MP: A 2(1 - l/A;)-approximation for Sym-Sub-MP is known 
via greedy combinatorial algorithms ||T9] |27| . However, no mathematical programming formulation for the 
problem has been previously considered. Here we show that, on instances of Sym-Sub-MP, the integrality 
gap of LE-Rel is 2(1 — l//c) by using an uncrossing property of symmetric functions. 

The following lemma is standard and it has been used in previous work ll2n . 

Lemma 3.1. Let f be a symmetric submodular set function over V and let Ai, . . . , Aj. be subsets ofV. Then 
there exist sets A'^, . . . , A'f^ such that ( i) A'- C Ai for 1 < i < k, ( ii) A'^, . . . , A'f^ are mutually disjoint ( Hi) 
Uj^- = UiAi and (iv) J2i fi^'i) ^ J2i fi^i)- Moreover, given the Ai's a collection of sets A'- satisfying the 
above properties can be found in polynomial time via a value oracle for f. 

Proof: Since / is symmetric, it satisfies posi-modularity; that is, f{X) + f{Y) > f{X — Y) + f{Y — X). 
From this we see that either f{X) + f{Y - X) or f{Y) + f{X - Y) is no larger than f{X) + f{Y). This 
allows us to uncross ^i, . . . , ^4^ as follows. If the Ai's are mutually disjoint then we can set A'- = Ai for each 
i and they satisfy the desired properties. Otherwise, there exist distinct i and j such that Ai n Aj ^ 0. We can 
replace Ai and Aj with Ai and Aj — Ai if f{A.i) + f{Aj — Ai) < f{Ai) + f{Aj); otherwise, we replace them 
by Ai — Aj and Aj. We repeat this process to get the desired sets. □ 

Theorem 3.2. The integrality gap of LE-Rel /or Sym-Sub-MP is < 2(1 - 1/k). 

Proof: Let X be an optimal solution to LE-Rel for a given instance of Sym-Sub-MP. Let A(1, 9),. . . ,A{k, 9) 
be sets obtained by applying ^-rounding to each i. By the property of ^-rounding, we observe that E[/(yl(i, 9))] = 
J2if{^i) = OPTpRAc- Note that Si belongs only to A{i, 9). We now apply Lemma |311 to ^(1, 6*), . . . , A{k, 9) 
to obtain A{, . . . , A'^. We have ^ . f{A'-) < J2^ fiMh f ))■ Let V = V - UiA'-. By symmetry of /, f{V') = 
fiUiA'i) and, since / is sub-additive, f{V') = /(Ui^Q < < E»/(^(^>^))- We can allocate V 

to any index i and the total cost of the allocation is upper bounded by f{V') + J2i fi^i) ^ 2 f{A{i, 9)). 
Thus the expected cost of the allocation is at most 20PTprac- The allocation is feasible since Sj belongs only 
to A{i, 9) and hence to A[. One can refine this argument slightly to obtain a 2(1 — l/fe) bound; we give the 
details in Appendix O □ 

In a previous version of this paper, we raised the following question. 

Question. Is the integrahty gap of LE-Rel for Sym-Sub-MP at most 1.5? 

As we already noted, we have shown in subsequent work ||3] that the integrality gap is at most 1.5 — l/Zc. 



7 



Rounding for Hypergraph-MC and Hypergraph-MP: Calinescu et al. El gave a new geometric relax- 
ation for Graph-MC, and a rounding procedure that gave a (1.5 — l/fc)-approximation; the integrality gap was 
subsequently improved to a bound of 1.3438 — Sk in [14], while the best known lower bound is 8/(7 + l//c — 1) 
U]. Calinescu et al. [2] derived their relaxation as a way to improve the integrality gap of 2(1 — 1/fc) for a natu- 
ral distance based linear programming relaxation; in fact, it often goes unnoticed that [2] shows the equivalence 
of their geometric relaxation to that of another relaxation obtained by adding valid strengthening constraints to 
the distance based relaxation. Interestingly, when we specialize MSCA to Graph-MC, LE-Rel becomes the 
geometric relaxation of [2|! The rounding procedure in [2 | can be naturally extended to rounding LE-Rel for 
SUB-MP and we describe it below. 



CKR-ROUNDING 

let a; be a solution to SubMP-Rel 

pick a random permutation 7rof{l,2,-- - ,k} 

pick 9 £ [0, 1) uniformly at random 

S* {{set of all assigned vertices)) 

for i = 1 to fc — 1 

^i{v\x{v,T:{i)) >e}-S) 
S^SU yl^(,) 

return (Ai, ■ ■ ■ ,Ak) 



CKR-RoUNDiNG uses the same 9 for all i and a random permutation, both of which are crucially used in the 
1.5-approximation analysis for Graph-MC. In this paper we investigate CKR-RoUNDiNG and other round- 
ings for Hypergraph-MC and Hypergraph-MP. 

Although Hypergraph-MC and Hypergraph-MP appear similar, their objective functions are different. 
The objective of Hypergraph-MC is to remove a minimum weight subset of hyperedges such that the ter- 
minals are separated, whereas the objective of Hypergraph-MP is to minimize J2e'^(^)p(^)' where p{e) 
is the number of non-trivial parts that e is partitioned into (a part is non-trivial if some vertex of e is in that 
part but not all of e). For graphs we have that either p{e) = or p{e) = 2, and therefore the two problems 
Hypergraph-MC and Hypergraph-MP are equivalent; this is the reason why one can view Graph-MC 
as a partition problem as well. However, when the hyperedges can have size larger than 2, the objective function 
values are not related to each other (it is easy to see that the Hypergraph-MP objective is always larger). 

Hypergraph-MP and Hypergraph-MC have been studied for their theoretical interest and their ap- 
phcations. It is easy to see from its definition that Hypergraph-MP is a special case of Sym-Sub-MP. It 
has been observed by a simple yet nice reduction lITSl that Hypergraph-MC is a special case of SuB-MP. 
In addition, it has been observed that Hypergraph-MC is approximation-equivalent to the noJe-weighted 
multiway cut problem in graphs (NODE-WT-MC) [9'1. 

We show that CKR-RouNDiNG gives a (1.5 - l/A;)-approximation to Hypergraph-MP and a tight Ha- 
approximation for Hypergraph-MC with maximum hyperedge size A. Note that when A = 2, H^. = 1-5 
and when A = 3, H^. — 1.833. For A > 3, CKR-RoUNDiNG gives a worse than 2 bound while we give an 
alternate rounding which gives a 2(1 — l/A;)-approximation. Our analysis of CKR-RouNDiNG differs from 
that in [2] since we cannot use the "edge alignment" properties of the fractional solution that hold for graphs 
and were exploited in \2^; our analysis for Hypergraph-MC is inspired by the proof given by Williamson 
and Shmoys |[25l . 

It is natural to wonder whether CKR-RoUNDiNG is crucial to obtaining a bound that is better than 2 for 
these problems, and in particular whether it gives a 1.5-approximation for Sym-Sub-MP. We show that a 
1.5 — 1/ /c-approximation for Hypergraph-MP (and hence Graph-MC also) can be obtained via a different 
algorithm as well; in particular, the crucial ingredient in CKR-RoUNDiNG for Graph-MC when viewed as 
a special case of Hypergraph-MP is the correlation provided by the use of the same 9 for all i; one can 



8 



replace the random permutation by the uncrossing scheme in Lemma [3?T] We describe this algorithm in the 
next section. However, for Hypergraph-MC, the random permutation is important in proving the i^A-bound. 



3.1 A 1.5-approximation for Hypergraph Multiway Partition 

We start by understanding the objective function of SubMP-Rel in the context of Hypergraph-MP. Let 
X be a feasible fractional solution, and let Xj = {x{vi,i), ■ ■ ■ ,x{vn,i)) be the allocation to i. Recall that 
/ here is the hypergraph cut function. What is Y17=i /(^i)^ For ^^'^h terminal i and each hyperedge e, let 

I{e, i) = [min^jge x(v^ i), max^ge x{v, i)]. Let (i(e, i) denote the length of /(e, i), and let d{e) = Yl\=i ^(^' 0- 
Note that d{e) S [0, |e|]. 

Lemma 3.3. ELi /(^i) = Ee'f«(e)4e)- 

Proof: Consider a hyperedge e. Let A{i^ 6) be the set whose characteristic vector is x^. For each G 
[Q,mm.y^f,x{v,i)\, the set A{i,6) contains all the vertices of e, and thus e ^ 5{A{i,6)). For each 6 £ 
(miiitige z), max^jge , i)], the set A{i,9) contains at least one vertex of e but not all of the vertices 
of e, and thus e G 5{A{i,9)). Finally, for each 9 G (max^jge i), 1], the set A{i,9) does not contain any 
vertex of e, and thus e ^ 5{A{i,9)). Therefore the contribution of e to /(xj) is equal to (maxt,ge i) — 
min^^e x{v, i))w{e) = d{e,i)w{e). □ 

A crucial technical lemma that we need is the following which states that the contribution of any i to d{e) is at 
most die) 12. 

Lemma 3.4. For any i, d{e,i) < d{e)/2. 

Proof: Let u = argmaxj,gea;(f , i), and w = argmin^gga;(f, i). We have 

d{e,i) = x{u,i) — x^Wji) 

1 - j) j - 1 

= ^{xiw.j) - x{u,3)) 
< y ( max X (v, j) — minx {v, j) 

= d{e) — d(e, i) 

Therefore d{e, i) < d{e) /2. □ 

The algorithm S ymSubMP-Rounding that we analyze is described below. We can prove that CKR-ROUNDING 
gives the same bound; however, SymSubMP-Rounding and its analysis are perhaps more intuitive in the 
context of symmetric functions. The algorithm does 0-rounding to obtain sets A{1,9), . . . , A{k, 9) and then 
uncrosses these sets to make them disjoint without increasing the expected cost (see Lemma ISTI ). 
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SymSubMP-Rounding 

let x be a feasible solution to SubMP-Rel 
pick G [0, 1] uniformly at random 

A{i, e) ^ {v\ i) > 9} for each i {I < i < k) 
{{uncross A{1,0),--- ,A{k,0))) 
A'i ^ A{i, 9) for each i (1 < i < fc) 
while there exist i ^ j such that A'^ r\ A'^ ^ % 
\f{f{A'^) + f{A'^~A',)<f{A[) + f{A-)) 



else 

A'. 

return {A'l,- ■ 



A'j A'^ 



A' - A' 



,A',_„V-{A[U---A',_,)) 



Theorem 3.5. SymSubMP-Rounding achieves an 1.5-approximation for Hypergraph-MP. 

We remark that we can change the algorithm and the analysis slightly to achieve a (1.5 — l/A;)-approximation; 
we give the details in Appendix iDl 

Lemma 3.6. Let i* be the index such that the interval I{e,i*) has the rightmost ending point among the 
intervals I{e,i). More precisely, I{e,i*) is an interval such that max^gg i*) = maxj max^ge , i); if 
there are several such intervals, we choose one arbitrarily. Let be an indicator random variable equal to 1 
iffe G 5{V - {A{1, 6*) U • • • U A{k, 6))). Then E[Ze] < d{e, i*). 

Proof: Note that is equal to 1 only if (1) for any terminal i, 9 is at least mint,ge x{v, i) and (2) there exists a 
terminal i such that 9 € /(e, t). If there exists a terminal i such that 9 is smaller than min^jge x{v, i), all of the 
vertices of e are in A{i, 9). If there does not exist a terminal I such that 9 € /(e, I), either all of the vertices of e 
are in A{i, 9) for some i or all of the vertices of e are mV — {A{1, 6*) U • • • U A{k, 9)). Finally, we note that (1) 
and (2) imply that 9 is in /(e, i*). By (2), 9 is at most maxt,ge x{v, i*) and, by (1), 9 is at least mint,ge x{v, i*). 

□ 

Proof of Theorem I3.5t It follows from the sub-additivity of / and Lemma 13.11 that the cost of the partition 
returned by SymSubMP-Rounding is at most 

k k 

fi^i) + f{V - K U • • • U ^^)) < f{Aii, 9)) + fiV - {Ail, 9)U---U A{k, 9))) 

i=l i=l 

Let OPTpRAc = J2i=i By Lemma [3^ and Lemma [34l 

EifiV - {Ail, 9)U...U Aik, 9)))] < E ^^^^ = 



Finally, ]E[X^j=i fi^^ih ^))] = OPTfrac. and therefore the expected cost of the allocation is at most I.SOPTfrac- 

□ 



3.2 Algorithms for Hypergraph Multiway Cut 

Now we consider Hypergraph-MC. For each hyperedge e, pick an arbitrary representative node r(e) G e. 
Define the function / : 2^ — )• ]R_|_ as follows: for A C V, let /{A) = Yle-r{e)&A egA'^(^) weight 
of hyperedges whose representatives are in A and they cross A. It is easy to verify that / is asymmetric and 
submodular. SUB-MP with this function / captures Hypergraph-MC |[T8l . 

Let X be a feasible fractional allocation and Xj be the allocation for i. For each hyperedge e and each 
terminal i, let I{e, i) = [min^ge x{v, i), max^gg x{v, i)]. Let d{e, i) = x{r{e),i) — min^ge x{v, i) and d{e) = 
ELi d{e,i). 
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Lemma 3.7. Eti />i) = Ee ^(e)rf(e). 

Proof: Consider a hyperedge e. Let A{i, 6) be the set whose characteristic vector is . For each 6 G 
[0, min^ge i)], the set A{i,6) contains all the vertices of e and therefore e ^ 5{A{i,6)). For each 
6 € (min„ge x{v, i),x{r{e),i)\, the set A{i, 6) contains the representative r(e) of e and e G (5(j4(i, 0)). Finally, 
for each € (x(r(e), i), 1], the set A{i, 6) does not contain the representative r(e). Therefore the contribution 
of e to /(xj) is equal to {x{r{e),i) — mint,ge i))w{e) = d{e, i)w{e). □ 



Half-Rounding 

let x be a solution to SubMP-Rel 
pick 6* e (1/2, 1] uniformly at random 
for i = 1 to fc — 1 

A{i,9) ^ {v I x{v,i) > 9} 

u(e) - 6*) u • • • u A(fc - 1, e)) 

return (A(l, 6>), ■ ■ ■ , A(fc - 1, 6^), ^/(g)) 



The main goal of this section is to show that HALF-RoUNDiNG is a 2-approximation, and CKR-ROUNDING 
is a i^A-approximation for Hypergraph-MC, where A is the maximum hyperedge size. We remark that we 
can show that the integrality gap of LE-Rel is at most 2(1 — 1 /k) using a connection between the distance LP 
for NODE-WT-MC considered in [9] and LE-Rel for Hypergraph-MC; we give the details in Appendix El 

Lemma 3.8. Let e be any hyperedge, and let z he any vertex in e. Let R{z) = {i \ x{z^ i) = maxy^^, x{v, i)}. 
Then J2ieR{z) 1-^(6,01 < d{e). 

Proof: Let u be the representative of e. If z = u, the lemma is immediate. Therefore we may assume that 
z u. We partition {1, 2, • • • , k} into two sets: the set S{z) consisting of all coordinates i such that x{z, i) is 
smaller than x{u, i), and the set B{z) consisting of all coordinates i such that x{z, i) is at least x(n, i). Since 
E^=i^('2,i) and Yl\=ix{u^i) are both equal to 1, the total difference between x{u,i) and x{z,i) over all 
coordinates i G S{z) is equal to the total difference between x{z, i) and x{u, i) over all coordinates i G B{z). 
Therefore 

^ {x{z,i) - x{u,i)) = ^ {x{u,i) - x{z,i)) 

< y { x(u,i) — rninxiv, I 

< die) — y ( x(n, i) — minx(v, z) ) 

i&B{z) 

Since R{z) is a subset of B{z), the lemma follows. □ 
Corollary 3.9. For each i, the length of the interval /(e, i) is at most d[e). 

Proof: Let /3 — max^_j^ max^ge xiv, i). Let be a terminal and let 6 be a vertex in e such that x{h, £) — /3. 
Since £ is in R{b), the corollary follows from Lemma |3^ □ 

Theorem 3.10. Let F be the set of all hyperedges crossing the partition returned by Half-ROUNDING. For 

each hyperedge e, Pr[e G F] < 2d{e). 

Proof: Let /(e, i*) be the interval with the rightmost right interval among the intervals /(e, 1), • • • , /(e, k); if 
there are several such intervals, we pick one arbitrarily. Note that e is in F only if 6 is in the interval I{e,i*). 
Therefore the probability that e is in F is at most 2|/(e, By Corollarv 13.91 the length of /(e, i*) is at most 
d{e). □ 
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Theorem 3.11. Let F be the set of all hyperedges crossing the partition returned by CKR-ROUNDING. For 

each hyperedge e, Pr[e G F] < H\f,y d{e). 

Proof Sketch: We say that Si splits e if ^ € I(e, i). Let Xi be the event that Si sphts e. We say that Si touches 
eifO < max^ge x{v, i). (Note that max^jge x{v, i) is the right endpoint of the interval /(e, i).) We say that Si 
settles e if Sj is the first terminal in the permutation vr that touches e. Let be the event that Sj settles e. 

Note that the edge e is in F only if there is a terminal Si that sphts and settles e. Therefore we can upper 
bound the probability that e is in F by Ya=i ^A^i ^ ^d- 

We relabel the terminals so that the ordering of the intervals {/(e, i)}i<i<k from right to left according to 
their ending point is /(e, 1), I(e, 2), • • • , /(e, A;). (If there are several intervals with the same ending point, we 
break ties arbitrarily.) After relabeling the intervals, Si settles e only if, for each j < i, 7r(sj) < vr(sj). This 
observation, together with the fact that Sj splits e with probability |/(e, implies that Pr[Xj Al^] < |/(e, 

Finally, let L(z) = {i \ i) = max^gg x{v, i)}. If an index i belongs to more than one set L(z), we only 
add i to one of the sets (chosen arbitrarily). Note that, by Lemma |3T8l the total length of the intervals I{e, i) 
where i G L{z) is at most d{e). This, together with the fact that the sets L{z) are disjoint and their union is 
{1, 2, • • • , k}, implies that Yh^i Pr[Xj A Y,-^ is at most H\f,yd{e). □ 

Proposition 3.12. The analysis in Theorem \3.11\ is tight. 

Proof: Let e be a hyperedge with representative u. Let e G (0, 1) be such that e[e| < 1. Consider a solution 
X that assigns the following values to the vertices of e. For each terminal i > \e\ and each vertex z € e, 
we have x{z,i) = 0. For each terminal i such that 1 < i < |e|, we have x{u,i) = (|e| — i)e. Finally, 
x{u, 1) = 1 — Yli=2 ^(^' Let U2, • ■ ■ ; dcnotc the remaining vertices of e (other than n). Now consider 
an index j such that 2 < j < |e|. We have x{vj,l) = e, x{vj,j) = x{u,j) — e, and x{vj,i) = x{u,i) for 
all i / j. Note that Yli=i x{v, i) is equal to 1 for all vertices v € e and the distance d{e) is equal to e. It is 
straightforward to verify that e is in F with probability at least H\^yd{e). □ 

4 Submodular Cost Labeling 

In this section we consider Sub-Label, which generalizes Monotone-MSCA, uniform metric labeling, hub 
location, and other problems. A natural algorithm here is KT-ROUNDING, which we have already introduced 
in Section 121 We also describe a different algorithm, S ymSubLabel-Rounding, which is appropriate for 
Sub-Label when the cut function is an arbitrary symmetric submodular function. We obtain several results 
that we state below. 

The next two results consider the Sub-Label problem on hypergraphs in which h is the following function. 
For each edge hyperedge e, pick an arbitrary representative node r (e) € e. We define the function /i : 2^ — )• 
as follows: for A C.V, let f{A) = '}2e-r{e)(^A egA ^(s) be the weight of hyperedges whose representatives are 
in A and they cross A. We refer to this function as the hypergraph separation cost function. 

Theorem 4.1. Ifh is the hypergraph separation cost function and each gi is modular, KT- ROUNDING achieves 
a A-approximation for Sub-LABEL. 

Theorem 4.2. If h is the hypergraph separation cost function and each gi is a monotone submodular function, 
KT-RoUNDiNG achieves an 0(lnn + A) approximation for Sub-Label. 
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SymSubLabel-Rounding 



let a; be a solution to LE-Rel 

S <— (A ((set of all assigned vertices)) 

^ for alH (1 < i < k) 
while S y^V 



pick i G {1,2, - ■ ■ ,k} uniformly at random 
pick 9 E [0,1] uniformly at random 

A, ^ A,U{v\ x{v, i) > 6} 



S ^ SU{v\ x{v,i) > 9} 
{{uncross Ai, - ■ ■ ,Ak)) 
A'i ^ A, for all i (1 < i < fc) 
while there exist i ^ j such that A[ n Aj ^ $ 



if if {A\) + f {A'^ ~ A'i) <f {A',} + f{A'^)) 



J J '' 



else 



return {A'-^,- ■ ■ , A'f^) 



Theorem 4.3. If h is a symmetric submodular function and each gi is a monotone submodular function, 
SymSubLabel-ROUNDING achieves an O(lnn) approximation for Sub-Label. 

Let PARTpRAc = Y!i=i H^i) be the partition cost of LE-Rel, and let COSTfrac = Yli=i 9i(.^-i) be the 
assignment cost of LE-Rel. 

Consider the Sub-Label problem in which h is the hypergraph separation cost function. For each hyper- 
edge e, let d{e) = J2i=i{^(.^i^)'''') ~ ™int)6e x{v, i)). By Lemma [377l PARTfrac = J2e w{e)d{e). In order to 
bound the expected partition cost of the labeling constructed by KT- ROUNDING, we consider each hyperedge 
separately, and we give an upper bound on the probability that the hyperedge has at least two vertices with dif- 
ferent labels. We say that a hyperedge e is split in some iteration of KT-ROUNDING if there exists an iteration 
I such that at least one vertex of e is assigned a label in iteration C but not all vertices of e are assigned a label 
in iteration The following lemma gives an upper bound on the probability that a hyperedge e is split. 

Lemma 4.4. For each hyperedge e, the probability that e is split is at most Ad{e). 

Using Lemma l4!4l we can complete the proofs of Theorem 14. H and Theorem 14. 21 as follows. 

Proof of Theorem |4H Let (Ai, • • • , A^) be the partition returned by KT-RoUNDiNG, and let PARTjnt = 
Yli=iK^i) and COSTiNT = J2i=i9ii^i)- shown in 1151, KT-RoUNDiNG assigns label i to v with 
probability x{v,i). Thus E[COST,nt] = COSTfrac- 

By LemmaO PARTfrac = Ee w{e)d{e). Therefore, by Lemmag^l E[PARTint] < A- PARTfrac- □ 

Proof of Theorem 1131 Let {Ai,--- , A^) be the partition returned by KT-RouNDiNG, and let PART,nt = 
Yli=i ^(^i) and COSTiNT = Yli=i 9i{^i)- Using the argument in the proof of Theorem 12.61 we can show that 
E[COSTint] < 0(lnn)COSTpRAc- Additionally, by LemmalM E[PART,nt] < APARTfrac- □ 

Now we turn our attention to the proof of Lemma |44] 

Proof of Lemma l44t Consider iteration i of KT- Rounding, and let and Oi be the label and threshold in 
iteration i. We say that iteration £ cuts e if Oi G [m.my^ex{v,ii),m.ayiy^ex{v,ii)]. We say that iteration £ 
touches e if Oi is in the interval [0, max„ge , i^)]. Let X£ and be the events that £ cuts and touches e 
(respectively). The probability that e is split in iteration £ is at most Pr[X^]/ Pr[Z£]. We have 



Pr[X^] < 7/ imaxx{v,i) 




i=l 



mina;(f , i) 1 < 



Me) 
k 
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where the last inequality follows from Lemma |3^ Additionally, 



1 ^ 1 
Vx\ZA = — > maxx(v,z) > — 



The last inequality follows from the fact that, for any vertex w € e, Ym=i x{w^i) = 1. It follows that the 
probability that e is split in iteration j is at most A(i(e). □ 

Proof of Theorem |4.3t Let and Oi be the label and 6 value chosen in the £-th iteration of the first while 
loop SymSubLabel-Rounding. For each z, let a = ^i:ii=i{v \ x{v,i) > 0e}. Let PARTballs = 
ELi /(A) and COST BALLS = X]i=i 9i{^i)- (Note that Ai is the set Ai at the end of the first while loop of 
SymSubLabel-Rounding.) 

Using the argument in the proof of Theorem 12.61 we can show that 

E[COSTballs] < 0(lnn)COSTpRAc 

and 

E[PARTballs] < 0(lnn)PARTpRAc 
Let {A[,--- , A'f^) be the partition returned by S ymSubLabel-Rounding. Let PARTjnt = Y^i=i fiA'^) and 
COST:nt = Ei=i9i{A'i). By LemmaEl 

E[PART,nt] < E[PARTballs] < 0(hin)PARTpRAc 

Since each gi is monotone, 

E[COST,ht] < E[COSTballs] < 0(lnn)COSTpRAc. 

□ 

Integrality Gap Example: We remark that we can generalize the integrality gap example of 115 ] in order to 
show that the integrality gap of LE-Rel is at least A(l — l/k) for Sub-Label when h is the hypergraph 
separation cost function, even if each gi is modular. 

Consider a A-uniform complete hypergraph on k vertices; all (^) edges are present, and each edge has unit 
weight. For each vertex i, the cost of assigning label j to i is zero if i ^ j, and infinity otherwise. 

It is easy to see that an optimal integral solution picks a label i and assigns label i to all vertices except i, 
and it assigns some other label to i. Thus the integral optimum is (^Zi)- Setting x{i,j) = 1/(A; — 1) for all 
i j gives us a fractional solution of cost (^) /{k — 1). 
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A Definition of tlie Lovasz extension 



Let / : {0, 1}" — > M be a function. The Lovasz extension / of / is the function / : [0, 1]" — M defined as 
follows. Let X be a vector in [0, 1]". We relabel the vertices as 1, 2, • • • , n so that a;i > a;2 > • • • > x^, for 
ease of notation, let xq = 1 and Xn+i = 0. Let S'j = {1, 2, ■ ■ ■ , i}. The value of / at x is equal to 

n 

/(x) = - Xi)f{Si) 

It is straightforward to verify that Yd=oi^i+i ~ ^O/C'S'i) = Kee[o,i][f i'^^)]- 

Another useful extension for a function / is its convex closure, which is defined as follows. For each set 
5 C y, we let Is denote the characteristic vector of S; that is, the i-th coordinate of I5 is equal to 1 if i is in 5* 
and otherwise. The convex closure / is the function : [0, 1]" M where /^(x) = ^^^ti{J2scv ^sf{S) : 
J2scv '^s^s = X, Ylscv = 1; -^5 ^ 0}- The Lovasz extension / of / is equal to the convex closure of 
/ iff / is submodular; see for instance ||6]. Using this result, we can show that LE-Rel can be solved in time 
that is polynomial in n and log (maxj ^cy fi{S)) via the ellipsoid method. 

Since / is equal to /~, we can write LE-Rel as follows. 



LE-REL-Primal 

k 

mill A(5,i)/,(5) 
i=i scv 

X{S,i) = x{v,i) Vf,i 

S:veS 

^A(5,i) = l Vz 

scv 

k 

x{v, i) = 1 

1=1 

x{s, i)>o ys, i 

x{v, i) > Vt;, i 



LE-REL-Primal is an LP with exponentially many variables and polynomially many constraints. Its dual 
LE-REL-Dual has polynomially many variables and exponentially many constraints. 



LE-REL-Dual 




k 

max A + 7t> 




1=1 vi^V 




Y.a{v,i)+P,<f,{S) 


ys,i 








\/v, i 



Separation oracle for LE-REL-Dual. Fix an assignment of values to the variables a, (3, 7 in LE-REL-Dual. 

It is easy to check in polynomial time whether 7^, < a{v,i) for all u, z since there are only nk such constraints. 
Let gi{S) = J2veS '^(^' + A- Note that gi is a modular function and therefore — is a submodular 
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function. Using a polynomial time algorithm for submodular function minimization, for a given i, we can 
check whether fi{S) - gi{S) > for all sets SCV. 

Therefore we can solve LE-REL-Dual in time that is polynomial in n and log (maxj ^cy fi{S)) using the 
ellipsoid method. Using standard techniques, we can also construct an optimal solution for the primal; we omit 
the details here. 



B Omitted proofs from Section |2] 

Proof of Proposition flAj Consider a terminal i. For any 6, A{i,9) is a subset of {v \ v ^ V,x{v,i) > 9}. 
Since /j is monotone, fi{A{i,6)) < fi{{v \ v € V,x{v,i) > 9}). Therefore 

E m,9)] = f U{A{i,9))d9 

< / h{{v\v eV,x{v,i)>9})d9 
Jo 



Finally, 



k 

i=l ' 



□ 



Proof of Proposition 12.5 1 Note that 



k 



We can prove by induction on the size of C/that^f^iEeg[o,i][|^(i,^)|] is equal to \U\. 

If U is empty, the claim trivially holds. Therefore we may assume that U contains at least one element z. 
Let U' = U — {z}, and let x' be the restriction of x to U'; more precisely, x'{v, i) = x{v, i) for all v G U' and 
all terminals i. Let A'{i,9) = {v \ v € U',x'{v,i) > 9}. Note that A'{i,9) is equal to A{i,9) - {z} if 9 is 
smaller than x{z, i), and A'{i, 9) is equal to A{i^ 9) otherwise. Therefore 



^ ' \A'{i,9)u{z}\de + "^ \A'{i,9)\de 

V ['\A'{i,9)\de + y2x{z,i) 

i=l -^0 i=l 
k I 

V / \A'{i,9)\de + i 

\U'\ + l (By induction) 

\U\ 



n 



18 



C Omitted proofs from Section |3] 



Theorem C.l. The integrality gap of LE-Rel /or Sym-Sub-MP is at most 2(1 -l/k). 

Proof: Let x be an optimal solution to LE-Rel for a given instance of Sym-Sub-MP. Without loss of general- 
ity, /(xfc) = maxj /(xj). Let ^4(1,6'), . . . , A{k — 1,9) be sets obtained by applying ^-rounding with respect to 
the first k — 1 terminals. By the property of 6'-rounding, we observe that '}2!i=i ^[f{-^{h ^))] = "^^=1 fi^i) ^ 
(1 - 1/A;)0PT 

FRAc- The last inequality follows from the fact that /(x^) > OPTprac/^- Note that Si belongs 
only to A{i, 9). We now apply Lemma O to ^(1, 6*), . . . , A{k - 1, 9) to obtain ^i, . . . , We have 

E'=i' fiA) < THzI f{Aii, 9)). Let A', = V- Ui<,<,_i4. By symmetry of /, f{A',) = f {U^<,<u-iA[) 
and, since / is sub-additive, /(A'J = /(Ui<i<fe_i^9 < EtiV(^-) < T!1=1 f{A{i,9)). We allocate A'^ 
to index k and the total cost of the allocation is upper bounded by f{A-[) < 2 Yli=i f{A{i, 9)). Thus the 
expected cost of the allocation is at most 2(1 — 1/A:)0PTfrac- The allocation is feasible since, for each i ^ k, 
Si belongs only to A{i, 9) and hence to A'^, and Sk belongs to A'j^. □ 



D Omitted proofs from Subsection I3A 



SymSubMP-Rounding 

let a; be a feasible solution to SubMP-Rel 
relabel the terminals so that /(x^) — max^ /(xi) 
pick 6 G [0,1] uniformly at random 

A{i, 0) {w I x{v, i) > 0} for each i{l <i <k - I) 
{{uncross A{1, 9),- ■■ ,A{k-l,e))) 
A[ ^ A{i, e) for each i{l<i<k~\) 
while there exist i ^ j such that A'^ O A'j 9 
if if {Ap + f{A'^-A[)<f{A'^ + f{A'^) 

Aj -Ir- Aj — 

else 

A\^A'^- A'j 
return {A'„ ■■■ , A'^_,,V - [A'^yj ■ ■ ■ A'^_,)) 



Theorem D.l. SymSubMP-Rounding achieves an (1.5 - l/k)-approximation for Hypergraph-MP. 

Lemma D.2. Let i* be the index such that the interval I{e, i*) has the rightmost ending point among the in- 
tervals I{e,i), where 1 < i < k — 1. More precisely, I{e,i*) is an interval such that maxy(zex{v,i*) = 
maxi<j<jt_i max^jge x{v, i); if there are several such intervals, we choose one arbitrarily. Let be an indi- 
cator random variable equal to 1 iffe £ 6{V — {A{1, 9) Li ■ ■ ■ L) A{k — 1, 9))). Then E[Ze] < c?(e, i*). 

Proof: Note that Zf. is equal to 1 only if (1) for any terminal i 7^ A;, is at least min^gg 3;(t', i) and (2) 
there exists a terminal I ^ k such that 9 e I{e,i). If there exists a terminal i ^ k such that 9 is smaller 
than min^ge x{v, i), all of the vertices of e are in A{i, 9). If there does not exist a terminal I ^ k such that 
9 G I{e,l), either all of the vertices of e are in A{i,9) for some i ^ k or all of the vertices of e are in 
V - {A{1, 6*) U ■ ■ ■ U A{k - 1, 9)). Finally, we note that (1) and (2) imply that 9 is in /(e, i*): by (2), 9 is at 
most max^,ge x{v, i*) and, by (1), 9 is at least mint,ge x{v, i*). □ 

Proof of Theorem ID.lt It follows from Lemma |3H that the cost of the partition returned by SymSubMP- 
ROUNDING is at most 



k-1 



f{A'^ + f{V - (A; U • • • U A',_,)) < f{A{i, 9)) + f{V - {A{1, 9)U---uA{k-l, 9))) 



i=l 



i=l 
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By Lemma ID^ and Lemma [3!4l 



E[f{V - {A{1, 9)U---UA{k-l, 9)))] < 



w{e)d{e) _ OPT, 
2 ~ 2 



FRAC 



e 



Since /(x/j) > OPTprac/^» we have 



E Yf{A{i,9)) < [l-i'joFT: 



FRAC 



Therefore the expected cost of the allocation is at most (1.5 — 1/A;)0PT 



FRAC- 



□ 



E Improved integrality gap bound for Hypergraph-MC 

The main goal of this section is to establish a connection between the distance LP for NODE-WT-MC considered 
in ||9l and LE-Rel for Hypergraph-MC. This connection gives us the following theorem. 

Theorem E.l. The integrality gaph of Hyper GRAPH-MC is at most 2{l— I /k) for instances of Hypergraph- 



The reader may wonder whether we can prove the above theorem directly without recourse to the result from 
||9l ; we believe that it can be done but there are some technical hurdles that we plan to address in a future version 
of the paper. As we already noted, Hypergraph-MC and Node-wt-MC are approximation equivalent. Oku- 
moto et al. [18] gave an approximation-preserving reduction from H YPERGRAPH-MC to NODE-WT-MC. The 
reduction maps an instance of Hypergraph-MC to an instance of NODE-WT-MC as follows. Let G = {V, E) 
be an instance of Hypergraph-MC, namely a hypergraph with weights on the edges and k distinguished ver- 
tices which we call terminals. We construct a bipartite graph H as follows. We add all of the vertices of G to 
H, and we give them infinite weight. For each hyperedge e of G, we add a vertex to H of weight We, and we 
connect it to all of vertices of V that are contained in the hyperedge. The terminals of H are the terminals of 
G, and it is straightforward to verify that a multiway cut in G corresponds to a node multiway cut in H of the 
same weight and vice-versa. 

We remark that there is an approximation-preserving reduction from NODE-WT-MC to Hypergraph-MC 
as well. Let G = {V, E) be an instance of NODE-WT-MC, namely a graph with weights on the vertices and k 
distinguished vertices called terminals. We may assume without loss of generality that the terminals form an 
independent set of G. We subdivide each edge of G, except the edges incident to the terminals. Now we can 
view the resulting graph G' as a bipartite graph with the terminals and the subdividing vertices on the left, and 
all other vertices on the right. We construct a hypergraph H as follows: the vertices of H are the left vertices 
of G' and, for each vertex v on the right, H has a hyperedge consisting of all neighbors of v (in the subdivided 
graph G'). The weight of the hyperedge corresponding to v is w{v). The terminals of H are the terminals of 
G, and it is straightforward to verify that a node multiway cut in G corresponds to a multiway cut in H of the 
same weight and vice-versa. 

Consider an instance G of Hypergraph-MC, and let x be a feasible solution to LE-Rel for this instance. 
Using the first reduction, we map an instance G of Hypergraph-MC to the instance H of NODE-WT-MC 
described above. In the following, we show that we can map the solution x to a solution d to the distance LP 
relaxation for H. The distance LP has a variable for each non-terminal v with the interpretation that is 1 
if V is in the node multiway cut. Let Vsi^Sj denote the set of all paths of H from si to Sj. 



MC. 
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(DiSTANCE-LP) 


mill d^Wy 

V 








> 1 
















dy > 


yv G V{H) - {si,-- 


• ,Sk} 



We map the solution x to LE-Rel to a solution d to DiSTANCE-LP as follows. For each vertex € 1^, we set 
dy = (recall that all the vertices in V have infinite weight). Let z be a vertex of H that corresponds to the 
hyperedge e of G. Let u be the representative of e, and let 



k 

E 

i=l 



x{u, i) — mill i)] = 1 — > min x( 



i=l 



Now consider a path p of H between two terminals Sa and Sh- Let p = vq — zi — vi — Z2 — • ■ ■ — Z£ — V£, 
where Zj is in V{H) — V , vj is in V, vq = Sa, and ve = Sb- Let ej be the hyperedge corresponding to zj; node 
that ej contains vj^i and Uj. For each vertex v G V, let = {x{v, 1), • • • , x{v, k)) denote the point on the 
fc-dimensional simplex to which v is mapped by the solution x, and let 1 1 • 1 1 1 denote the li norm of a vector. 



e / k \ 
> 1 — mm{x{vj-i,i), x{vj ,i)} (ej contains vj-i and Vj) 

j=i \ i=i ) 



I k 

- imn.{x{vj-i,i),x{vj,i)}) 

j=i i=i 

e k 

maxjO, i) — 

j=i i=i 



i=i i=i 



\x{Vj_l) - x{Vj)\ 



> ^\\x{Sa) - X{sb)\\l = 1 

Therefore d is a feasible solution to DiSTANCE-LP. Garg, Vazirany, and Yannakakis [9] showed that the in- 
tegrality gap of DiSTANCE-LP is at most 2(1 — l/k). Therefore the integrality gap of LE-Rel is at most 
2(1 — l/k) as well. The above argument also establishes that LE-Rel is at least as strong a relaxation as 
DiSTANCE-LP for NODE-WT-MC. Easy examples show that it is strictly stronger. 
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